Maintenance of the Prelarge Trees for Record Deletion

نویسندگان

  • Chun-Wei Lin
  • Tzung-Pei Hong
  • Wen-Hsiang Lu
چکیده

The frequent pattern tree (FP-tree) is an efficient data structure for association-rule mining without generation of candidate itemsets. It, however, needed to process all transactions in a batch way. In addition to record insertion, record deletion is also commonly seen in real-application. In this paper, we propose the structure of prelarge trees for efficiently handling deletion of records based on the concept of pre-large itemsets. Due to the properties of pre-large concepts, the proposed approach does not need to rescan the original database until a number of records have been deleted. The proposed approach can thus achieve a good execution time for tree construction especially when a small number of records are deleted each time. Experimental results also show that the proposed approach has a good performance for incrementally handling deleted records. Key-Words: data mining, FP-tree, Prelarge-tree algorithm, pre-large itemsets, record deletion.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maintenance of prelarge trees for data mining with modified records

The frequent pattern tree (FP-tree) is an efficient data structure for association-rule mining without generation of candidate itemsets. It is used to compress a database into a tree structure which stores only large items. When data are modified, it, however, needs to process all transactions in a batch way. In the past, the prelarge-tree structure was proposed to incrementally mine associatio...

متن کامل

The first record of Truncatella angustata from Chamaecyparis lawsoniana trees in the northeast of Iran

-----------------------------------------------------------------------------------------      

متن کامل

Drechslera triseptata, a new record for Iranian mycoflora

Quercus persica and Pistacia mutica are two known and prevalent forest trees of Baneh and Marivan from Kurdistan province (W Iran). Thirty-five fungal isolates were obtained from 50 bark samples of Q. persica collected in Sept. and Mar. 2010–11. The aim of this study was to identify fungi associated with oak trees of Kurdistan province. Based on morphological features and authentic keys (Ellis ...

متن کامل

Adaptive Approximate Record Matching

Typographical data entry errors and incomplete documents, produce imperfect records in real world databases. These errors generate distinct records which belong to the same entity. The aim of Approximate Record Matching is to find multiple records which belong to an entity. In this paper, an algorithm for Approximate Record Matching is proposed that can be adapted automatically with input error...

متن کامل

Maintenance of Discovered Sequential Patterns for Record Modification

In the past, some researchers proposed efficient incremental mining algorithms for maintenance of sequential patterns as records were inserted or deleted. In addition to record insertion and deletion, record modification is also very commonly seen. Although maintenance of sequential patterns for record modification can be performed by usage of the deletion procedure and then by the insertion pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007